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Abstract.  It^^hown  that  Ca(±)  +  do(D(A})  +  ir(D(A))  =  fl(/C  +  Q)log(/C  -f  Q)  and 
+  do(D(A))  =  Q(K  -f-  Q)  log(/C  +  Q)  in  the  cases  of  DFT,  vector  convolution,  and 
matrix  mutliplication.  Here  Ca(Q)  is  the  additive  complexity  of  a  (bi)lincar  algorithm 
A  for  the  given  problem,  D(A)  and  0(A)  are  the  two  acyclic  digraphs  that  represent  A, 
each  of  them  is  obtained  from  another  one  by  reversing  directions  of  all  edges,  ir(D)  and 
do(D )  are  two  numbers  that  are  introduced  to  measure  the  structural  deficiencies  of  an 
acyclic  digraph  D.  K  and  Q  are  the  numbers  of  outputs  and  input-variables.  do(D(A))t 
do(D(A)\  and  ir(D(A))  characterize  the  logical  complexity  of  A.  Also  lower  bounds  on 
CU(i)  +  do(D{A))  and  on  j  are  expressed  in  terms  of  algebraic  quantities  such  as 

the  ranks  of  matrices  and  of  multidimensional  tensors  associated  with  the  problems.  — 
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L  Introduction. 

The  design  and  analysis  of  arithmetic  algorithms  for  Discrete  Fourier  Transform,  DFT, 
convolution  of  vectors,  CV,  cyclic  convolution  of  vectors,  CCV,  and  matrix  multiplication, 
MM,  are  among  the  central  problems  of  algebraic  complexity  theory,  cf.  [1-4].  In  this 
paper  we  analyze  the  arithmetic  and  logical  complexity  of  such  algorithms.  We  focus  on 
the  classes  of  linear  and  bilinear  arithmetic  algorithms  (see  the  definitions  in  Sections  2,  3 
where  we  basically  follow  [1-4]).  There  are  three  reasons  for  that  choice.  First,  the  linear 
and  bilinear  algorithms  are  natural  ones.  It  is  easier  to  analyze  and  to  implement  them. 
Secondly,  the  fastest  known  arithmetic  algorithms  for  DFT,  CV,  CCV,  and  MM  are  linear 
and  bilinear  ones.  Thirdly,  it  is  known  that  all  arithmetic  algorithms  for  a  linear  or  bilinear 
arithmetic  computational  problems  such  as  DFT,  CV,  CCV,  and  MM  can  be  turned  into 
linear  or  respectively  bilinear  algorithms  for  the  same  problem  and  the  total  number  of 
arithmetic  operations  in  the  algorithm  remains  the  same  up  to  a  constant  factor,  cf.  [3, 
pp.  34-38,  117]. 

For  the  analysis  of  the  arithmetic  complexity  of  linear  and  bilinear  algorithms,  it  is 
technically  convenient  to  estimate  separately  C(i),  C(*),  and  C5C,  the  total  numbers  of 
additions/subtractions,  nonscalar  multiplications,  and  scalar  multiplications  respectively 
involved  in  the  algorithm.  This  can  be  considered  the  first  step  of  the  study  followed  by 
the  estimates  for  the  bit-complexity  of  the  algorithms,  cf.  [5].  So  far  the  further  progress 
in  the  area  has  been  blocked  by  the  two  following  problems.  Estimate 

i)  Cmin(*)  for  MM  (cf.  [6,7]), 

and 

ii)  Cmln(±)  for  DFT,  CV,  and  CCV  (cf.  [4,  8-11]). 

In  this  paper  we  seek  for  lower  bounds  on  Cm\ n(zt),  the  additive  complexity  of  a  given 
(bi)linear  problem,  that  is  on  the  minimum  C/\(=j=)  over  all  (bi)liucar  algorithms  A  for  that 
problem.  (This  also  had  led  us  to  the  study  of  logical  complexity  of  algorithms.)  We  focus 
on  the  cases  of  DFT,  CV,  CCV,  and  MM. 

Despite  the  long  and  intensive  study  of  the  problem  (cf.  [1-4]),  all  known  lower 
bounds  on  <7min(i)  still  rely  on  the  methods  cf  substitution  due  to  [12-14];  cf.  also  [15], 
consequently  the  lower  bounds  range  between  K  and  K  +  Q.  Here  Q  and  K  are  the 
total  numbers  of  input  variables  and  respectively  of  (liuearly  independent)  outputs  of  the 
algorithm.  However,  those  methods  themselves  by  their  nature  cannot  give  even  the  bound 
K  +  Q  +  1.  For  comparison  rc  t  the  best  known  upper  bounds  on  C(±)  have  the 

asymptotic  order  ( K  +  (?)Iog(#  ,  m  the  cases  of  DFT,  CV,  and  CCV,  cf.  [8,9],  and 
(K  -f  Q)a,  a  ^  5/4,  in  the  case  of  MM,  cf.  [6,7]  (a  in  [7]  is  less  than  [6]  by  ^  0.01 
but  the  approach  of  [7]  substantially  uses  the  earlier  designs  whose  origin  is  surveyed  in 
detail  in  [6]  while  in  the  otherwise  successful  paper  [7]  the  confusing  name,  “Schonhagc's 
construction”,  does  not  help  the  reader). 


So  far  the  lower  bound  K  +  Q  +  1  on  the  C’(=j=)  has  been  obtained  for  neither  of  the 
problems  (DPT,  CV,  CCV,  MM)  unless  the  model  of  computation  is  seriously  restricted  as 
in  [16]  where  the  bound  f l(K  +  Q)  \og(K  +  Q)  was  established  by  an  ingenious  method 
but  only  under  quite  a  strong  requirement  that  the  constants  of  the  algorithms  be  bounded 
by  1. 

In  this  paper  we  do  not  impose  any  restrictions  on  our  (bi)lincar  model  of  computation. 
In  Section  5. 1-5.3  we  associate  (bi)linear  algorithms  A  with  acyclic  digraphs  D  =  D{A ). 
Their  vertices  have  outdegrees  2  and  0  and  represent  the  ±  operations  of  A  and  the 
input-variables  of  A  respectively.  Then  equals  the  number  of  vertices  of  D  that 

have  outdegree  2.  In  order  to  estimate  that  number,  we  try  to  partition  all  vertices  of  D 
into  p(D)  levels  ( p(D),  the  profundity  of  D,  is  defined  in  Section  5.2)  whose  cardinalities 
are  bounded  from  below  by  %  the  number  of  linearly  independent  outputs  of  the  problem. 
Then 

P(D)r  <  CA(±). 

However,  the  desirable  partition  might  not  exist.  In  that  case  we  show  that  D  must  have 
some  defects  in  its  structure.  To  be  more  precise,  we  formally  define  ir(D)  -j-  do[D ),  a 
measure  for  structural  deficiency  of  an  arbitrary  acyclic  digraph,  D  =  ( VtK ).  We  fix 
Lo(i9),  a  basic  set  of  vertices  of  D.  (Lq(D)  could  be  the  set  of  all  vertices  of  D  of  indegree 
0  but  for  D  =  D{A)  we  always  choose  Lq(D)  as  the  set  of  outputs  of  A.)  Then  we  say 
that  regularly  all  vertices  of  outdegree  0  (inputs  of  A)  are  equally  (and  hence  maximally) 
removed  from  L0(D)  (in  terms  of  the  longest  paths  from  L0(D)  to  the  vertices  of  outdegree 
0).  In  the  general  case  D  can  be  irregular.  Then  we  estimate  ir(D),  the  total  deviation  of 
vertices  of  outdegree  0  from  those  farthest  from  Lq(D)  positions.  The  ir{D)  measures  the 
irregularity  of  D  and  will  be  called  the  irregularity  number  of  D. 

The  disorder  generated  by  an  edge  {u,v)  from  u  to  i>,  u  7^  v,  is  defined  as  the 
difference  between  the  positive  lengths  of  the  longest  and  the  shortest  directed  paths  from 
u  to  v.  The  disorder  around  v  is  the  maximum  (for  all  u)  disorder  generated  by  the  edges 
(u,  v)  with  the  head  v.  The  disorder  number  of  D  is  the  total  disorder  around  all  its 
vertices. 

Both  numbers,  ir{D)  and  do(D)  measure  the  deficiencies  of  the  logical  structure  of 
the  algorithm  A  represented  by  the  digraph  D{A).  (Their  study  might  also  be  of  interest 
itself.) 

In  Section  5.3  we  show  that  both  deficiencies  can  be  corrected  by  a  simple  transfor¬ 
mation  of  D.  This  transformation  relics  on  joining  some  appropriate  paths  to  D.  In  the 
result  of  the  transformation  all  new  vertices  have  outdegree  1  and  other  vertices  do  not 
change  their  outdegrees.  If  Z>  =  D(A ),  such  transformation  basically  consists  in  arrang¬ 
ing  ir(D)  +  do(D)  additional  units  to  store  some  inputs  and  intermediate  results  of  the 
algorithm  A.  We  prove  that  totally  ir[D)  +  do[D)  new  vertices  of  outdegree  1  suffice 
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to  define  a  transformation  of  that  kind  that  turns  D  into  a  new  digraph  D  such  that 
jr(Z>)  =  do(D)  =  0  and  p(D)  =  p{D).  Then  the  desired  partition  of  D  into  levels  exists 
and  defines  the  lower  bound,  p(D)r,  on  the  number  of  vertices  of  the  resulting  digraph  D 
that  have  nonzero  outdegrees.  Yet  the  latter  number  equals  Ca(±)  +  do{A)  +  tr(D(A)) 
and  can  be  considered  the  sum  of  additive  and  logical  complexities  of  A.  (The  terms  do(A) 
and  tr(A)  can  be  alternatively  interpreted  as  the  measures  for  additional  storage.)  In  the 
cases  of  DFT,  CV,  CCV,  and  MM,  p{D)r  =  Q(K  +  Q)\og{K  +  Q)  (see  Theorem  5.3.1 
and  Equations  (5.3.6)— (5.3.8)  in  Section  5.3).  This  defines  nonlinear  lower  bounds  on  the 
^A(i)  +  do(D(A))  +  ir(/?(A))  which  are  sharp  up  to  a  constant  factor  in  the  cases  of 
DFT,  CV,  and  CCV. 

This  narrows  the  class  of  possible  choices  for  the  designs  of  fast  algorithms  for  the 
latter  problems  and  also  measures  the  sacrifices  in  the  logical  structure  of  such  algorithms 
which  are  necessary  if  the  algorithms  exist. 

In  the  remaining  sections  (5.4  and  6)  we  study  the  lower  bounds  on  the  C>v(i)  + 
do[D(A))  and  In  Section  5.4  we  prove  that  CA(±)+do(-£>(A))  >  1  MmO*) 

where  s(/i)  equals  2^  or  2h  —  1,  p  is  the  matrix  of  the  coefficients  of  the  given  (bi)linear 
problem,  and  rp(p)  is  the  p-rank  of  p.  (The  p-rank  of  a  matrix  p  is  defined  in  Section 
4  as  a  generalization  of  the  conventional  rank,  r  =  ro(p).)  We  conjecture  (Proposition 
4.1)  that  for  some  constant  a,  0  <  a  <  1 ,  we  have  that  rp(p)  =  fi(/C  +  Q)  for  p  = 
(K  -f-  Q)a  in  the  cases  where  p  is  the  matrix  associated  with  DFT,  CV,  CCV,  and  MM. 
We  do  not  really  attack  that  hard  and  interesting  problem.  Instead  we  establish  other 
bounds  on  rp(p)  and  hence  linear  lower  bounds  on  Ca(±)  +  do(A)  that  exceed  K  +  Q 
(sec  Theorem  5.4.2  in  Section  5).  Also  we  prove  (see  Corollary  5.4.2  in  Section  5.4)  that 
at  the  very  least  any  design  of  an  algorithm  A  for  DFT,  CV,  CCV,  or  MM  such  that 
C*A(i)"Fdo(A)  =  o(K  +0)  lo g(K  -f-Q)  must  be  a  counterexample  to  our  conjecture  (that 
is  to  Proposition  4.1). 

Finally  we  observe  that  we  can  reverse  the  directions  of  edges  of  D[A)  and  choose  the 
set  of  input- variables  of  A  as  the  basic  set  L0{D)  of  the  new  digraph,  D(A).  Then  our 
construction  give  the  lower  bound  CA(rt)  T*  do(b{A))  =  H(/C  +  Q)log(/C  +  Q)  m 
cases  of  DFT,  CV,  CCV,  and  MM. 

In  Section  6  we  estimate  Ca(Ji)  from  below.  (Assume  that  the  increase  of  the  disorder 
number  of  algorithms  is  not  prohibitive.)  We  combine  the  approach  of  [16]  with  our 
extension  of  the  concept  of  tensor  ranks  (cf.  [1,  p.  488]  or  [17],  sec  also  [18,  19]).  We  define 
r(P(X)),  the  rank  of  an  arbitrary  polynomial  P(X),  and  in  particular,  r(m),  the  rank  of 
any  minor  m  of  the  matrix  p(X),  that  defines  a  bilinear  computational  problem.  Then 
we  show  that 

CU(±)  >  tog2  r(m) 

for  any  bilinear  algorithm  ^4  for  such  a  problem  and  for  any  minor  of  n{X).  This  leads 


to  open  problems  of  estimating  maxm  log2  r(m)  or  other  related  algebraic  quantities.  The 
material  of  Sections  2,  3,  5.1,  6  is  self-contained. 


2.  Linear  and  bilinear  arithmetic  computational  problems. 

Hereafter  we  assume  that  all  linear  and  bilinear  forms  that  we  consider  are  homoge¬ 
neous,  unless  otherwise  stated  and  that  all  logarithms  are  to  the  base  2.  We  use  the 
following  notation. 

Notation.  /,  J ,  K ,  i,  j,  k  are  nonnegative  integers,  <p  is  a  vector,  /i  is  a  matrix,  r  is 
a  (3-dimensional)  tensor,  detjx  is  the  determinant  of  fi.  (/i)tJ  is  the  entry  of  fx  lying  in 
row  i  and  column  j ,  (<£>),  is  the  entry  i  of  vector  <p.  Similarly,  the  entries  of  tensor 

r  are  defined.  £(p),  b(<p)  is  a  linear,  respectively  bilinear  form  of  indetermiuates  (<p)i  for 
all  i  with  the  coefficients  from  F.  F  is  a  field  of  constants.  |Y|  and  [ij  are  the  two 
nearest  integers  such  that  [z]  <  x  <  \x].  X(u)  is  o(0(u))  if  X(u)/0(v)  — ►  0  for  v  — ►  oo. 
X(v)  is  O(0{y )),  respectively  fi(0(t/))  if  there  exist  two  constants  c0  >  0,  cx  >  0,  such 
that  X(v)  <  c0d(v)  for  |v|  >  C\}  respectively  X(v)  >  co0(u)  for  |v|  >  c,  X(v)  ~  0(u)  if 
simultaneously  X(t>)  is  0(0(u))  and  0(u)  is  0(X(u)).  X>  Y_y  Z  are  vectors  of  indeterminates, 
Xi  =  (X)«,  Vx  =  (X)j,  Zk  =  (Z)k.  xx  =  =,  Zk  =  0  unless  0  <  i  <  /,  0  <  j  <  /, 

0  <  k  <  K .  €  F}  fijk  6  F  for  all  z,  7,  A:.  If  5  is  a  set  then  |S|  is  the  cardinality  of 

S. 

Definition  2.1.  A  linear  (arithmetic  computational)  problem  is  a  set  of  linear  forms, 

J—  1 

tk{Y)  =  53  /,•*»,•  for  =  0, 1,  . .  .,K  —  1 .  (2.1) 

J  —  o 

A  bilinear  (arithmetic  computational)  problem  is  a  set  of  bilinear  forms, 

/ — l  J—i 

bk(X,  Y)  =  2  ^  for  k  =  0,  . . . ,  K  -  1 .  (2.2) 

»  =  0  j — 0 

Also  a  linear  problem  can  be  equivalently  represented  by  the  bilinear  form,  b(Y^f  Z),  or  by 
the  matrix,  such  that 


fC-l 

b(Y,  Z)=  53  tk{Y)zk ,  (M),fc  =  /Jfc  for  all  * .  (2.3) 

fc=0 

A  bilinear  problem  can  be  equivalently  represented  by  the  trilinear  form,  t(X_,Y,Z), 
by  the  3-dimensional  tensor,  r,  and  by  the  matrices  /j(X),  p(Y),  fi(Z),  such  that  for  all 
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*'»  j,  k 

K  —  1 

t(X,  Y,Z)=  £  bk(X,  Y)zk ,  (r)ijk  =  fijk  ,  (2.4) 

wi))„  =  S  /<**< . 

t  =  0 

J— 1 

WO)*-  £/«»*.  (2  5) 

J-0 

K-l 

wax,  =  £ 

fc=0 

Here  are  some  examples  of  linear  and  bilinear  problems  where  m ,  n  are  given  non¬ 
negative  integers.  We  represent  a  linear  problem  by  6(X,  Y)  and  /z  and  bilinear  problems 
by  t(X_,YyZ_)  and  /i(X).  The  transiton  to  the  representations  (2.1),  (2.2)  can  be  easily 
done  with  formulas  (2.3)-(2.5). 

Problem  2.1.  (DFT,  Discrete  Fourier  Transform  in  n  +  1  points). 

b(X_,£)  —  o  Zfc  ^kyj •  (/*)jfc  “  u  is  a  primitive  (n-f  l)-root  of  unity. 

Problem  2.2.  (CV,  convolution  of  two  vectors,  or  polynomial  multiplication. 
t{X,Y,Z)  =  £r=o”  Zfc  EjLo  xk—]Vj  ■  {n{X))jk  =  xk-j,  where 

Xi  =  0  if  i  <  0  or  i  >  m ,  yj  =  0  if  j  >  n .  (2.6) 

Remark  2.1.  The  matrix  /i(X)  of  CV  is  a  particular  case  of  a  general  (n  -f- 1)  X  (ra  n  -f- 1) 
Toeplitz  matrix,  T(2£);  (T(X))jk  =  j>  j  =  0, 1,  . . . ,  n,  k  —  0, 1,  . . . ,  m  +  n.  That 
particular  case  is  defined  by  the  following  relations. 

xs  —  0  for  5  <  0  and  for  s  >  m . 

CV  is  the  problem  of  the  evaluation  of  the  coefficients  of  the  polynomial  in  X,  g(X)r(X) 
where  the  coefficients  of  q(\)  =  XiX*  and  r(X)  =  X*7  are  given. 

Problem  2.3.  (CCV,  Cyclic  convolution  of  vectors  or  multiplication  of  two  n-the  degree 
polynomials  modulo  Xn+l  —  1). 

n  k  "k 

<(£XZ)  =  £  +  E  *E-,V,). 

(M(2C))>fc  =  Ifc-i  >  f  j  <k,  (m(X))j/c  =  Xk_.  otherwise.  (2.7) 
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Here  j,  k  =  0,1,  . . . ,  n,  /c  =  A:  -|-  n  -f- 1 .  (2.7)  defines  n[X_)  as  the  general  (n  1)  X  (n  + 1) 
circulant  matrix  that  is  the  general  (n  +  1)  X  (n  -j-  1)  Tocplitz  matrix  (cf.  Remark  2.1), 
where  n  =  Xj+n-|_i  for  i  <  0. 

Problem  2.4.  (MM,  matrix  multiplication.  (/  =  J  =  K  =  n2 .) 

For  this  problem  we  represent  the  vectors  X_ ,  Y,  Z  as  the  matrices  X,  Y,  Z,  so  that 
(2C).  =  P0a/3,  {Y)j  =  ( Y)fil ,  {Z)k  =  ( Z)1Q ,  i  —  an  +  /?,  j  =  p  +  qw,  k  =  7«  -f  a, 
a,/?,q  =  0,1 . n—  1. 


n  —  1 

t{X,  Y,Z_)  =  xo0V0i^a  =  Tr(XYZ) , 

a, 0,7  =  0 

(p(X))  =  (^)a/3  if  j  ~  /3  +  qn,  A:  =  qn  -f  a  for  some  a,  /?,  q, 

(h[2Q)  =  0  otherwise. 

Here  Tt{XYZ)  is  the  trace  of  the  matrix  XYZ. 

Remark  2.2.  (2.8)  defines  n(X)  as  the  block-diagonal  n2  X  n2  matrix  where  each  of  n 
diagonal  blocks  is  the  matrix  X . 


3.  Linear  and  bilinear  arithmetic  algorithms. 

Given  a  vector  of  input-variables  (indeterminates),  V  =  (vj,  .  ..,vq),  and  a  field 
of  constants,  F,  then  an  arithmetic  algorithm  A  is  defined  as  a  sequence  of  arithmetic 
operations,  (alt  ...,ac)  such  that 


a,  =  as(ag(s),ah{s)),  g[s)  <  s ,  h[s)  <  s ,  s=l,  ...,C,  (3.1) 

each  aa(u,w)  is  either  u  +  to,  or  u  —  w,  or  uw,  or  u/w,  g(s),  h(s )  are  integers,  ai_<  =  Vi 
for  t  =  1,  . . . ,  Q,  Oi  6  F  for  i  <  — Q.  Given  a* ,  a  subset  of  the  set  {ai,  . .  .,ac}>  then 
A  is  said  to  evaluate  a *  over  F  and  to  have  the  complexity  C. 

We  will  study  arithmetic  algorithms  that  evaluate  given  sets  of  linear  forms,  {£jt(H)}, 
then  V  =  Y,  Q  =  Jt  cf.  (2.1),  or  of  bilinear  forms,  {6/c(2C Y)},  then  V  = 

Q  =  I  +  J,  cf.  (2.2).  In  those  cases  wc  consider  the  natural  classes  of  linear  and  bilinear 
algorithms.  An  arithmetic  algorithm  (3.1)  is  called  linear  if  for  all  s  either 

as(u,  w)  —  uw  and  ag(s)  £  F ,  (3.2) 

or 

ad(u,  w)  =  u  -j-  w  or  a^u,  w)  —  u  —  w  . 


i 


1 

m 


i 
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(3.3) 


Operations  (3.2)  and  (3.3)  are  called  scalar  multiplications  and  additious/subtractions  (or 
operations  ^)  respectively.  Notice  that  for  all  s  the  as  are  linear  forms  in  V  in  the  case 
of  linear  algorithms. 

An  arithmetic  algorithm  (3.1)  is  called  bilinear  if  it  consists  of  the  following  four 
successive  steps. 

Step  1,  Linear  algorithm,  A\  where  the  set  of  input- variables  is  X- 

Step  2.  Linear  algorithm,  A2  where  the  set  of  input-variables  is  Y\ 

Step  3.  Multiplication  algorithm,  A3  =  (ai,  where  as  =  ag^s)a^s)y  s  = 

1,  all  ay(5)  are  outputs  of  Step  1,  and  all  are  outputs  of  Step 

2.  The  operations  of  Step  3  are  called  nonscalar  multiplications. 

Step  4.  Linear  algorithm,  A4  where  the  set  of  input-variables  is  the  set  of  outputs  of 

Step  3.  (If  it  is  allowed  to  extend  the  sets  of  input-variables  of  each  of  Steps 

1  and  2  to  (X,  T),  then  Steps  1-4  are  said  to  define  a  quadratic  algorithm. 

Quadratic  algorithms  are  not  considered  in  this  paper.) 


4.  The  height  of  a  linear  form  and  the  p-rank  of  a  matrix. 

In  this  section  we  define  two  auxiliary  concepts  (height  and  p-rank). 

Definition  4.1.  H{t ),  the  height  of  f(T),  a  linear  form  of  Y  (respectively  of  fi(X,  T),  a 
bilinear  form  of  X,  H),  with  the  coefficients  from  F,  is  the  number  of  nonzero  coefficients 
of  the  given  form.  H  and  H ,  the  height  and  respectively  the  lower  height  of  a  (bi)linear 
problem  are  the  maximum  and  respectively  the  minimum  heights  of  the  (bi)linear  forms 
to  be  evaluated. 

Notation.  Cy\(i)  is  the  number  of  ^  operations  involved  in  an  algorithm  A.  Ct(±)  is 
the  minimum  C/\(i)  for  all  linear  algorithms  A  that  evaluate  a  linear  form  f’(T)* 

The  next  obvious  lemma  enables  us  to  estimate  C^db)- 

Lemma  4.1.  Let  1\{Y),  IsQQ  be  linear  forms  ofY  over  F .  Let  l[Y)  =  /i^i(T)  +  /2^2(H); 
where  h,h£F.  Then  H(i)  <  H{lx)  + 


Corollary  4.1,  cf.  [12].  Ct(±)  =  H(t)  —  1. 

The  next  three  definitions  will  introduce  another  auxiliary  concept,  the  p-rank  of  a 
matrix. 

Definition  4.2.  Two  linear  forms  arc  p-aeighbors  if  the  height  of  their  difference  is  at  most 

P. 
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Definition  4.3.  Given  two  sets  of  linear  forms,  {7jt(H)}  and  {^.(H)},  and  the  two  associated 
matrices  of  their  coefficients,  p  and  p! ,  then  p  and  p'  are  p-neighbors  iff  for  each  k  the 
two  linear  forms  /fc(Y)  and  i fk(Y_ )  are  p-neighbors,  or  equivalently  iff  the  matrix  p!  —  /l i 
has  at  most  p  nonzero  entries  in  each  column. 

Definition  4.4.  rp(p),  the  p-rank  of  a  matrix,  p,  is  the  minimum  rank  of  its  p-neighbors. 
rp(p(X))>  the  p-rank  of  a  matrix  p(X),  is  max^o  rp(p(X°)). 

Remark  4.1.  Similar  but  different  definitions  could  be  obtained  if  “row”  substitutes  for 
“column”.  r0(p)  is  the  conventional  rank  of  p. 

As  is  obvious,  an  arbitrary  m  X  rn  matrix,  p,  has  a  p-neighbor  with  at  least  p  zero- 
rows,  so  that  rp(p)  <  m  —  p.  On  the  other  hand,  for  a  given  r,  the  testing  f  rp(p)  <  r 
can  be  reduced  to  the  solution  of  a  Gnite  number  of  systems,  each  of  (m  —  r)n  algebraic 
equations  for  (m  —  r)r~{~np  variables.  (Fix  r  candidates  for  r  basic  rows  among  the  finite 
number  of  possible  choices  and  try  to  express  other  m  —  r  rows  as  linear  combinations 
of  basic  ones.)  This  suggests  that  (m  —  r)n  <  (m  —  r)r  +  np  and  hence  (m  +  n  —  r)r  > 
(m  —  p)n.  If  m  ~  n,  p  =  7 n,  r  =  /?n,  then  (2  —  /?}/?  >  1  —  7. 

However,  we  do  not  see  any  way  to  turn  the  above  preliminary  estimates  into  the  lower 
bounds  on  rp(p)  and  rp(p(X))  in  the  specific  cases  where  p  and  p(X))  are  the  matrices 
of  Problems  2. 1-2.4,  Those  who  look  for  hard  and  interesting  problems  are  encouraged  to 
try  to  prove  or  disprove  the  following  proposition. 

Proposition  4.1.  There  exists  a  constant  a,  0  <  a  <  1  such  that  p  <  nQ  implies  that 

rp(/x)  =  fl(n) ,  rp(n{X_))  =  H(n) 

if  p  is  the  matrix  of  Problem  2.1  and  p(X)  is  the  matrix  of  Problem  2.2  (where  m  ~  n) 
or  2.3.  If  p  <  nQ  and  p(X)  is  the  matrix  of  Problem  2.4  then 

rp(n(X))  =  n(n2)  • 

In  Section  5.4  we  show  how  to  use  Proposition  4.1  (if  it  is  true)  in  our  study  of  linear 
and  bilinear  algorithms,  and  how  even  weaker  lower  estimates  for  rp(p)  and  rp(p(X))  can 
be  helpful. 


5.  Algorithms  as  digraphs  and  the  lower  bounds  on  the  sum  of  the  additive  and  logical 
complexities. 

We  partition  this  long  section  into  four  subsections. 


5.1.  (Bi)linear  algorithm  as  a  sequence  of  ^  operations.  Assume  that  the  indeterminants 
(2C)i  for  all  i  are  replaced  by  their  instances  taken  from  F .  Then,  of  course,  each  bilinear 
algorithm  turns  into  a  linear  one  and  Ca{ ±)  does  not  increase.  (We  can  assume  that  the 
instances,  (X)i,  are  always  chosen  such  that  C^dt)  takes  its  maximum  value.)  Hence  the 
lower  bounds  on  the  Cmin(jj  over  the  class  of  such  linear  algorithms  are  also  the  lower 
bounds  on  the  Cmjn(ih)  over  all  bilinear  algorithms  for  Problems  2.2  2.4,  In  this  and  in 
the  next  sections,  the  words  “Problems  2. 2-2. 4”  will  be  freely  used  as  a  shorthand  for  “the 
linear  instances  of  Problems  2. 2-2.4”  where  this  does  not  cause  confusion. 

We  will  count  only  Jr  operations.  Thus  we  rewrite  an  arbitrary  linear  algorithm,  cf. 
(3.1)  (3.3),  in  a  more  convenient  way  for  our  analysis,  that  is,  as  (a^L  j,  a2- -J>  •  *  *  i  aCA(±))> 
a  sequence  of  linear  forms  in  the  input-variables,  Y_ ,  where 

aT  =  +  f'ianU)  ’  "W  <  3  -  »?(»  <  3  - 

/,eF,  /'6F,  j  =  1,2,  ...,CA(±),  (5.1.1) 

=  for  j  =  0, 1,  ...,J  —  1 ,  (5.1.2) 

at  7 for  all /GF,  s^q.  (5.1.3) 

Notice  that  C^(dd)  equals  the  number  of  all  a^~  in  (5.1.1)  where  j  >  1. 

Hereafter,  we  assume  that  the  is  defined  up  to  a  constant  nonzero  factor  /  £  F, 

so  that  in  fact  each  is  the  set  {/ ,  /  €  F}  of  proportional  linear  forms  in  y  where 

//(a  +  )  >1  iff  j  >  1,  H(af)  <  ff(a+:))  +  H(a+}))  for  all  ;  >  l . 


5.2,  Algorithms  as  digraphs  partitioned  into  levels.  Irregularity  numbers.  We  associate  A, 
an  algorithm  (5.1.1)  (5.1.3),  with  D(A)  =  (V’(A),  /2(A)),  an  acyclic  directed  graph,  where 
v^l)  =  {af,j  =  l-  J,2-  J,  ...,CA{±)},  E{A)  =  {«,<,)),(<,%+)),;  = 

1.2,  .  ..,C>v(i)}.  A  vertex  a +  has  outdegree  2  if  j  >  0  and  outdegree  0  if  j  <  0.  The 
Ca(ic)  ‘s  equal  to  the  number  of  vertices  of  V(A)  of  outdegree  2.  This  motivates  our 
upcoming  study  of  acyclic  digraphs. 


Definition  5.2.1.  Given  an  acyclic  digraph  D  =  ( V ,  E)  then  a  (directed)  path  in  D  of 
length  s  >  0  is  a  pair  of  chains  of  edges  {(v0,vl),(vi,v2),  ...,(«/,_ t,  va))  and  of  vertices 
(v0,ui,  .  ..,v3)  where  [vh,Vh+i)  G  E  for  all  h.  (All  paths  are  directed  and  have  positive 
lengths.)  A  set  V  C  V  circles  v  G  V  iff  any  directed  path  from  v  either  enters  V ,  or  can 
be  continued  to  V .  The  subset  V,q  of  V*  C  V  consists  of  all  vertices  of  V’  of  outdegree 
q.  Let  Lq{D )  be  a  hxed  (basic)  subset  of  V .  The  shortest  path  level,  Lq(D)  and  the  longest 
path  level,  LLq{D)  arc  the  two  subsets  of  V  such  that  o  G  L<,(D)  (resp.  v  G  LLq{D))  iff  each 
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shortest  (resp.  each  longest)  path  from  Lq(D)  to  v  consists  of  q  edges.  In  this  case  q  =  d(v) 
(resp.  q  —  r(u))  is  the  distance  (resp.  the  remove)  from  L0(D)  to  v.  d{D)  =  max^y  d(v) 
is  the  depth  of  D.  p[D)  —  inaxvev'  »"(w)  is  the  profundity  of  D.  PLq(D)  —  Pq(P>)  \  V° 
and  PLLq(D)  —  LLq(D)  \  are  the  subsets  of  Lq(D)  and  LLq(D)  resp.  They  consist 
of  the  vertices  of  positive  outdegrees.  The  set  Lq(D)  =  Lq[D)  U4<,  (LS{D))°  is  the  level 
Lq{D)  extended  by  joining  the  vertices  of  outdegree  0  from  the  previous  levels  of  D. 

Now  we  can  write  the  following  basic  relation, 

d(D)-l  p(D)  -  1 

Ca(±)=  ]T  \PLq(D)\  =  £  \PLLq(D)\  (5.2.1) 

« = 0  q  0 

if  D  —  D(A)  represents  a  linear  algorithm  A. 

Remark  5.2.1.  YVe  always  choose  the  set  of  outputs  of  A  as  the  basic  subset  L0(D)  if 
D  =  D(A). 

We  start  with  lower  estimates  for  \Lq{D)\.  Such  estimates  will  follow  as  ;\  corollary 
from  the  next  two  lemmas. 

Lemma  5.2.1.  Given  an  acyclic  digraph  D  —  (V ,  E)  with  a  basic  subset  L0[O)  C  V,  a 
positive  integer  q,  and  a  vertex  v  t  /.0(D).  Then  the  set  Lq{D)  circles  v. 

Remark  5.2.2.  If  Lq(D)  had  been  defined  as  LLq[D)  U,< ,  (LLS(D))°  then  Lemma  5.2.1 
would  not  have  held.  (For  some  digraphs  D,  (iQ(D))°  and  LL\{D)  are  empty,  IJ^(D)  is 
not.) 

Lemma  5.2.2.  Let  a  digraph  D  =  0(A)  =  (F(yl),  E(A) )  represent  a  linear  algorithm 
A;  cf.  (5.1.1)-  (5.1.3).  For  an  integer  j  and  for  a  set  of  integers  S,  all  from  the  closed 
interval  (l  —  J,CA(±)),  let  the  subset  {a+,s  6  S }  of  V(A)  circle  a G  V(A).  Then 
at  =  f*eF  for  alls. 


Corollary  5.2.1.  Let  a  digraph  D  =  D(A)  =  (V(A),  E(A))  represent  a  linear  algorithm  A 
for  a  linear  problem  with  matrix  p,  cf.  Remark  5.2.1.  Then 

\~l'q(D)\  >  r0(n)  forq  =  0, 1 . d(D(A))  —  1 . 

Here  r0(p)  is  the  rank  of  p. 

Proof.  r*o(/i)  is  the  number  of  linearly  independent  outputs.  That  number  is  not  less 
than  |A,(Z1)|  by  Lemmas  5.2.1  and  5.2.2.  | 
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Next  we  estimate  Ca(±)  using  (5,2.1)  and  the  following  definition. 

Definition  5.2.2.  An  acyclic  digraph,  D  —  (V,  E)  is  shortest  (res.  longest)  path  regular  one 
iff  all  its  vertices  of  outdegree  0  belong  to  the  level  L^(d)(D)  (resp.  LLp(p)(D)).  The  two 
numbers,  ir*(D)  =  zr#(u)»  *r(^)  =  2r(v)>  are  l^e  shortest  and  resp.  the 

longest  path  irregularity  numbers  of  D.  Here  ir*(v)  —  d(D)  —  d(u),  tr(u)  =  p{D)  —  p(v). 

Remark  5.2.3.  D  can  always  be  transformed  into  a  shortest  (resp.  longest)  path  regular 
acyclic  digraph  by  joining  the  path  defined  by  the  chain  of  vertices  (t^u),  /z  =  0,1,2,  . . . ,  s) 
to  each  v  £  V°.  Mere  i>o(u)  =  v  and  s  is  the  length  of  the  chain,  s  =  ir*(v)  (resp.  s  = 
ir(i/)).  Totally  ir*[D)  (resp.  ir{D))  new  vertices  and  as  many  new  edges  are  to  be  joined 
to  D  during  the  regularization  process. 

Combining  Definition  5.2.2,  Corollary  5.2.1,  and  Equation  (5.2.1),  we  obtain  the 
following  estimate. 

Corollary  5.2.2.  Under  the  conditions  of  Corollary  5.2.1,  C^dt)  +  ir*(D(A))  > 
r0(n)d(D(A)). 

The  latter  estimate  itself  gives  little  guidance  to  the  designers  of  the  fast  linear 
algorithms.  Indeed,  C.a(±)  =  O(r0(n))  implies  that  r0(n)d(D(A))  —  it' (D(A))  =  0(r,(/i)) 
but  in  general  d(D(A))  can  be  as  small  as  2  for  any  linear  problem.  (Indeed,  given  an 
algorithm  A  (cf.  (5.1.1))  then  for  a  6  Lq[D(A)),  f  €  F,  j  >  0,  /,  ^  0,  substitute  the  next 
three  ±  steps  for  (5.1.1):  a„(,)  =  a+(j)  +  fjO.,  a,  =  fjav[j)  +  /Jojyj,  a+  =  a,  —  f~f}a. 
Choose  a,  fj  such  that  as  f  a+  for  all  q  and  for  s  =  j,  s  =  i/(j).) 

In  the  next  subsection  we  present  a  more  meaningful  lower  estimate. 

5.3.  Ordered  levels,  disorder  numbers  and  lower  bounds. 

Definition  5.3.1.  The  levels  of  D  =  (V,E)  arc  ordered  up  to  the  level  LLq(D)  if  r(v)  = 
r(u)  +  1  for  any  edge  (tx,v)  G  E  such  that  r(v)  <  q.  The  levels  of  D  are  ordered  if  they 
are  ordered  up  to  the  level  LLq[D)  where  q  =  p[D).  doq{D )  is  the  disorder  number  of 
{ LLS(D ),  s  =  0,l . q)  if 

doq(D)  =  2Z  d°{v) ,  do(v)  =  max  do(u,  u) ,  do(u,v)  —  r(v)  —  r(u)  —  1 .  (5.3.1) 

V 

Here  the  maximization  is  for  all  u  such  that 


and  the  summation  is  for  all  v  such  that  r(v)  <  q .  do(D)  =  dop^(D)  is  the  disorder 
number  of  D.  do(utv)  is  the  disorder  generated  by  the  edge  (u,  v)  and  do(v)  is  the  disorder 
around  v. 

The  next  two  simple  lemmas  and  corollary  indicate  the  way  to  apply  the  digraphs 
with  the  ordered  levels  to  our  main  problem. 

Lemma  5.3.1.  LS(D)  =  LLS{D)  for  s  =  0,1,  ... tq  if  the  levels  of  D  are  ordered  up  to 
the  level  LLq(D).  In  particular ,  if  q  —  p(D)  then  p(D)  —  d(D),  ir(D)  =  ir*(D). 


Lemma  5.3.2.  Under  the  conditions  of  Corollary  5.2.1 ,  p(D(A))  >  log  H  where  H  is  the 
height  of  the  linear  problem  solved  by  A.  (To  prove,  apply  Lemma  4.1.) 


Corollary  5.3.1.  Let  a  digraph  D  =  D(A)  with  ordered  levels  represent  a  linear  algorithm 
A  for  a  linear  problem  with  matrix  p.  Let  H  designate  the  height  of  the  problem  (cf. 
Definition  4.1).  Then 

d(D(A))  >  log  H  ,  CA(±)  +  ir(D{A))  >  r0(p)  log  H  . 

(Corollary  5.3.1  follows  from  Lemmas  5.3.1,  5.3.2,  and  Corollary  5.2.2.) 

Next  we  will  show  how  to  order  the  levels  of  an  arbitrary  acyclic  digraph  and  will 
estimate  the  cost  of  such  ordering. 

Lemma  5.3.3.  For  any  q  >  0  an  acyclic  digraph  D  =  ( V ,  E)  can  be  transformed  into  a 
digraph  Dq  =  (Vq,Eq)  whose  levels  are  ordered  up  to  the  level  LLq[Dq)  and  such  that 

v\^V*  for  tV  1 ,  V1^1,  |^|  =  |^|4-^,(D).  (5.3.3) 

In  particular,  Dp(d)  —  D  =  {V,E)  has  ordered  levels  and 

V'  =  Vi  fort  ^1,  |V’l|  =  |VlH-(fo(r>).  (5.3.4) 

Proof.  Let  r(v)  <  q.  Let  U(v)  designate  the  set  of  all  vertices  u  £  V  such  that  (5.3.2) 
holds  and  E(v)  designate  the  set  of  edges  (u,v)  of  E  with  the  head  v.  Then  substitute 
the  following  subdigraph  D(v)  =  (V(v),E(v))  for  the  E(v);  cf.  (5.3.1). 

V(t>)  =  (t)fc(v)  ,fc  =  1,2,  ... ,  do(v)} . 

E(v)  =  {(vt(v)  >  v»-t(v))  for  s  =  1, 2,  . . . ,  do(v) ,  (u,  v<t0(u,v){v))  for  all  u  6  U{v)} 
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where  t)o(v)  =  v. 

The  desired  digraph  Dq  =  (V",,  Eq)  results  if  such  substitutions  are  done  for  all  v  6  V 
such  that  r(v)  <  q.  Indeed,  (5.3.3)  is  easily  verified,  as  well  as  the  following  property:  for 
each  edge  (u,v)  £  E  such  that  r(v)  <  q  in  D,  the  shortest  directed  path  in  Dq  from  u 
to  v  has  the  length  do(u,v)  -f- 1  and  has  only  the  vertices  of  outdegree  1  except  u  and  v. 
This  implies  that  r(v)  =  r(u)  -f  1  for  all  (u,v)  £  Eq  such  that  r(v)  <  q.  | 

Remark  5.3.1.  The  remove  of  a  common  vertex  v  of  V  and  V  is  the  same  in  D  and  D. 
By  the  virtue  of  Lemma  5.3.1,  it  equals  d( v)  in  V.  In  particular,  it  follows  that 

p(D)  =  p(D)  =  d(D)  >  d(D) ,  ir[D)  =  ir(D)  =  tr*(D)  >  ir*(D) . 

Combining  the  transformations  of  D  of  Remark  5.2.3  and  of  the  proof  of  Lemma 
5.3.3,  we  turn  D  into  D,  a  longest  path  regular  digraph  with  ordered  levels.  By  the 
virtue  of  Lemma  5.3.1  and  Remark  5.3.1,  D  is  also  a  shortest  path  regular  digraph. 
Combining  Remarks  5.2.3  and  5.3.1,  Corollary  5.3.1,  and  Lemma  5.3.3  we  obtain  the 
following  estimate. 

Theorem  5.3.1.  Let  a  digraph  D  —  D(A)  represent  a  linear  algorithm  A  associated  with 


matrix  p.  Let  H  be  the  height  of  the  problem.  Then 

Ca(±)  +  do(D(A))  +  ir{D(A))  >  r0(p)  log  H  .  (5.3.5) 

As  is  easily  verified, 

r0(p)logff  =  (n  -f  1)  log(«  +  1)  for  Problems  2.1,  2.3,  (5.3.6) 

ro(p)  log  H  —  (m  +  n  -|-  1)  log  max(m  +  1,  n  1)  for  Problem  2.2,  (5.3.7) 

ro(p)  log  11  =  2n2  log  n  for  Problem  2.4.  (5.3.8) 


On  the  one  hand,  relations  (5.3.5)-(5.3.7)  enable  us  to  narrow  the  search  of  new  faster 
algorithms  for  DFT,  CV,  CCV.  On  the  other  hand,  even  if  such  algorithms  exist,  they 
must  have  some  deficiencies  in  their  logical  structure  that  are  quantitatively  represented 
by  the  sum  do(D(A))-\-ir(D(A)).  The  increase  of  ir[D(A))  might  seem  a  lesser  evil.  This 
motivates  our  next  study  of  the  lower  bounds  on  CU(;t)  +  do(D(A))  based  on  the  two 
simple  lemmas  presented  in  the  beginning  of  the  next  subsection. 
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5.4.  Lower  bounds  on  Ca(±)  +  do(D(A))  in  terms  of  the  p- ranks  of  the  matrix  of  the 
problem. 

Lemma  5.4.1.  Let  all  vertices  of  an  acyclic  digraph  D  have  outdegrees  0,  1,  or  2.  Let 
v  G  Lq(D)  and  let  q  >  0  be  an  integer.  Then  for  some  s  <  2q  there  exist  s  vertices  from 
Lq(D)  (cf.  Definition  5.2.1 )  that  circle  v. 

Proof.  Apply  Lemma  5.2.1.  The  bound  2  on  the  outdegrees  implies  that  at  most  2** 
paths  from  v  to  Lq(D)  exists.  | 

Lemma  5.4.2.  Let  q  be  a  nonnegative  integer  and  D  —  D(A)  represent  a  linear  algorithm, 
A.  Let  v  G  Lo{D)  be  an  output  of  A  (which  is  a  linear  form  in  Y).  Then  there  exists  an 
h-neighbor  of  v  (cf.  Definition  4.2)  that  can  be  represented  as  a  linear  combination  of  g 
vertices  of  positive  outdegrees  in  D  which  are  elements  of  Lq(D)  and  linear  forms  in  Y, 
such  that  log(g  +  h)  <  q.  If  the  height  of  v  (cf.  Definition  4.1)  is  greater  than  2q  then 
g  >  1. 

Proof.  Apply  Lemmas  5.2.2  and  5.4.1.  Represent  v  as  f*v*  where  (cf.  Definition 

5.2.1)  vt  G  U’=0 {Ls(D))°  for  t  =  l,  . . . ,  h,  vx  G  PLq(D)  for  i  =  h  +  1,  . . . ,  h  +  g. 

Recall  that  all  vt  and  v  are  linear  forms  in  Y_  with  coefficients  from  F  and  notice 
that  the  height  of  v,  equals  1  for  t  =  1,  . . .  ,h.  Hence  v  —  fiv%  =  YLi~h+ 1  f'v »  *s 

an  A-neighbor  of  v .  If  g  =  0  the  height  of  v  equals  h  <  2q.  | 

Corollary  5.4.1.  Let  a  digraph  D  =  D(A)  represent  a  linear  algorithm  for  a  problem  with 
matrix  p  (cf.  Remark  5.2.1).  Then  for  any  integer  s  >  0, 

iPLAO)\>rhM(M),  (5.4.1) 

h(s)  =  2a  —  1  if  the  heights  of  all  outputs  of  A  are  greater  than  2s ,  (5.4.2) 

h(s)  —  2a  otherwise.  (5.4.3) 

Remark  5.4.1.  For  s  =  0  the  estimate  (5.4.1)  can  be  improved  as  follows, 

\PL0(D)\>c(p).  (5.4.4) 

Here  and  hereafter  c(p)  designates  the  number  of  noD proportional  outputs  of  the  heights 
greater  than  1  or  equivalently  the  number  of  nonproportional  columns  of  p  with  more 
than  1  nonzero  entries.  If  m  >  1,  n  >  1  then  c(p)  =  n  4-  1  for  Problems  2.1,  2.3, 
c(p)  =  m  -f-  n  —  1  for  Problem  2.2,  and  c(p)  ~  n2  for  Problem  2.4.  Bound  (5.4.1)  can 
also  be  strengthened  for  s  >  0.  We  leave  this  possibility  as  a  challenge  to  the  reader. 
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Let  a  digraph  D(A)  represent  a  linear  algorithm  A;  cf.  (5.1.1)— (5-1-3).  We  can  apply 
the  transformation  of  D(A)  from  Lemma  5.3.3  to  order  the  levels  of  D(A).  Then  combining 
Equation  (5.2.1),  Lemma  5.3.3,  Corollary  5  4.1,  and  the  bound  (5.4.4)  gives  the  following 
estimate. 

Theorem  5.4.1.  Let  a  digraph  D  =  D(A)  represent  a  linear  algorithm  A  for  the  problem 
with  a  matrix  p  (cf.  Remarks  5.2.1,  5.4.1).  Then  (cf.  (5.4.2),  (5.4.3))  for  all  q 

q  p(D) — 1 

Ca(±)  +  doq(D(A))  >  c{p)  +  £  rh(4)(p)  +  £  \PLL.(D)\.  (5.4.5) 


In  particular,  for  q  =  p(D)  —  l, 

P(D)-1 

CA(±)  +  do(D(A))>  c(p)  +  £  rh[a)(p)-  (5.4.6) 

3  =  1 

Those  readers  who  believe  that  Proposition  4.1  is  true  (why  not?)  will  probably  be 
happy  with  the  following  result  (cf.  (5.4.6). 

Corollary  5.4.2.  If  Proposition  4.1  holds  then  for  any  linear  algorithm  A  for  Problems  2.1- 
2.3,  C/i(i)  +  do(D(A))  =  ft(nlogn)  (assuming  that  m  ~  n  in  the  case  of  Problem  2.2) 
and  for  any  linear  algorithm  A  for  MM  (Problem  2.4),  C^(i)  +  do(D(A))  =  fl(n2  logn). 
Here  D(A)  is  the  digraph  that  represents  A. 

Thus  a  designer  of  an  algorithm  for  Problems  2.1-2.3  where  m  =  n,  Ca(±)  + 
do(D(A ))  =  o(nlogn)  must  at  least  disprove  Proposition  4.1. 

In  fact,  some  linear  lower  bounds  on  C>t(;L)  do(D(A))  that  exceed  K  -f-  Q  can  be 
established  immediately.  For  instance,  the  following  results  can  be  proven,  see  [20]. 

Theorem  5.4.2.  Let  A  be  an  arbitrary  linear  algorithm  for  one  of  the  Problems  2. 1-2.4. 
Then  the  following  lower  bounds  on  CU(rt)  +  do(A)  hold:  2 n  -f-  l.lq(n  -f- 1)  —  28  for  DFT, 
m-}-2.33n  -}-0.3min(m,n)  —  28  for  CV  (where  m  >  2,  n  >  2),  3.1(n+  1)  —  28  for  CCV, 
3.5n2  —  10.5n  +  8  for  MM  (n  >  2).  Here  q(s)  designates  the  maximum  prime  that  divides 
an  integer  s. 

In  the  next  section  we  reconsider  the  problem  of  lower  bounds  on  C^(i). 

As  follows  from  (5.3.5)-(5.3.8), 

CA(±)  =  n(/f  +  Q)log(/C  +  Q) 
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in  the  case  of  algorithms  for  DFT,  CV,  CCV  and  MM  whose  digraphs  are  regular  and  have 
ordered  levels.  When  the  paper  had  been  written,  Thomas  Spencer,  a  graduate  student 
at  Stanford  University,  Stanford,  California,  noticed  that  the  latter  bound  holds  even  if 
the  regularity  requirement  is  deleted.  He  also  raised  the  questions  if  the  term  tr(D(A))  in 
(5.3.5)  can  be  deleted  or  at  least  if  in  the  cases  of  DFT,  CV,  CCV  and  MM, 

K  +  Q  =  +  do(D(A)) . 

These  questions  remain  open.  Unsuccessfully  trying  to  answer  them,  the  present 
author  noticed  however  that  for  an  arbitrary  linear  algorithm  A  for  a  problem  of  the 
height  H  associated  with  the  matrix  ^  the  next  estimate  immediately  follows  from  our 
construction  of  Sections  5. 1-5.3. 

Ca(±)  +  do(D{A))  >  r0{n)  log  H .  (5.4.7) 

(Consequently  C a(±) do(D(a))  >  QlogQ  for  DFT,  CV,  where  m  =  n,  CCV  and  MM.) 
Here  D  =  D(A)  is  the  acyclic  digraph  obtained  from  D(A)  by  reversing  the  directions  of 
all  edges  of  D{A)  and  the  basic  set  Lq(D)  is  the  set  of  all  vertices  of  D  associated  with  the 
input-variables  of  A  or  equivalently  having  indegree  0  in  D.  In  this  case  the  basic  relation, 

d(D)  p(D) 

CA(±)^J2\Lq(b)\='£)LLq{D)\f 

<7  =  1  <7  =  1 

is  simpler  than  (5.2.1)  and  also  we  can  substitute  Lg{D)  for  Lq(D)  in  Lemma  5.2.1.  This 
allows  us  to  dispense  with  the  regularization  of  D.  The  meanings  of  the  numbers  do(D(A)) 
and  do(D(Aj)  are  similar  although  those  numbers  can  be  different. 


6.  Lower  bounds  on  C(i)  in  terms  of  the  rank  of  determinant 

Hereafter  L(2C  F1)  designate  the  set  of  all  linear  forms  in  X_  with  coefficients  from  a 
field  F.  (The  set  L(X_,F)  includes  the  elements  of  F .) 

Definition  6.1.  If  Pn{K)  =  Pn(2C i,  ...,Xn)  *s  an  ^-linear  form  in  n  vectors  of  indeter¬ 
minants,  2£i>  •  •  •  >2£n  then  R(Pn{X-)),  the  polylinear  rank  of  Pn(X),  equals  (cf.  [1,  p.  488] 
or  [17])  the  minimum  integer  R  such  that 

R  n 

II  WE'Ot.'i'  (e.i) 

g=l  fc=l 
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If,  more  generally,  Pn{X_)  is  any  polynomial  of  degree  n  in  X  then  r(Pn(X)),  the  rank  of 
Pn(X_)>  >s  the  minimum  integer  r  such  that 

r  n 

Pn{X)  =  £  J[Lgh(X),  Lgh(X)eL(X,F).  (6.2) 

0  =  1  1 

For  a  matrix  p  =  p(X)  with  the  entries  from  L{X_,F)t 

r(/i)  =  max  r(m)  (6.3) 

mtzD 

(where  D  =  £>(p(X))  is  the  set  of  all  minors  of  p(X))  is  the  rank  of  the  bilinear  problem 
associated  with  p(X). 

Equations  (6.1)  and  (6.2)  imply  that 

r(Pn(X))<R{Pn{X)).  (6.4) 

The  R(P2{Xx,X2))  is  equal  to  the  rank  of  the  matrix  associated  with  a  bilinear  form 
Pi(Xx,X2).  R(P3(Xx,X2,X3))  is  equal  to  the  multiplicative  complexity  of  the  three 
bilinear  computational  problems  associated  with  /^(Xu  X2,X3)  (cf.  [18,  19]). 

Next  we  will  modify  the  construction  of  [16]  to  show  how  the  additive  complexity  of 
the  bilinear  problem  associated  with  a  matrix,  /i,  is  related  to  the  rank  r(/i)  (cf.  (6.3)). 

In  Section  2  we  associated  linear  forms  in  Y_  (whose  coefficients  arc  from  L(F,X)) 
with  the  column-vectors  of  their  coefficients.  An  5-tuple  of  such  vectors  forms  a  7  X  s 
matrix.  In  particular,  the  sequence  of  linear  forms  a~^,  ;  =  1  —  7,  2  —  7,  ...,(M(:Jr) 
(cf.  (5.1.1)),  as  well  as  the  associated  column-vectors  of  their  coefficients,  represents  an 
arbitrary  bilinear  algorithm.  The  matrix  fi{q)  that  consists  of  the  first  7  +  <7  columus 
represents  the  algorithm  up  to  its  q th  +  operation. 

The  set  of  the  matrices  {p{q) ,  q  =  1  —  7,  .•’,(?>*(+)}  must  satisfy  the  following 
properties. 

i)  /i(0)  is  the  7  X  7  identity  matrix. 

ii)  fi{q  -f  1)  =  (m(<?)  |  V{q  +  1))  for  q  =  0,1,  ...,CU(±)  —  1  where  V{j)  = 
fjV(v(jj)  +  f}V(r)(j)),  fj£F,  /'  e  F,  u(j)  <  j,  r)[j)  <  j,  Vl(s)  is  the  (J-fs)th 
column-vector  of  p(s);  cf.  (5.1.1). 

iii)  The  matrix  of  the  given  bilinear  problem  p(X)  is  a  submatrix  of  p(Ca(±)). 

Now  we  observe  (cf.  [16))  that  the  property  ii)  implies  the  next  identities  in  X  for  all 

q  >  0,  m(q  -j-  1)  =  /  m(q)  f'm'(q).  Here  m[q  -f-  1)  is  an  arbitrary  minor  of  fi{q  -}-  1), 
/  €  F,  /'  €  F,  and  m(q),  m'(q)  are  some  minors  of  n(q).  Hence  (cf.  (6.3)) 

r{n{q  +  0)  <  2r(/i(q))  for  q  —  0, 1 . Ca(±)  —  1 .  (6.5) 
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As  is  obvious, 


(6.6) 

(6.7) 


KMo))  =  i. 

MCa(±)))  >  r(p(X)). 

Equations  (6.5)  (6.7)  imply  the  following  result. 

Theorem  6.1.  Given  a  bilinear  algorithm,  A,  for  the  bilinear  problem  defined  by  a  matrix 
p  =  p(X).  Then  CA(±)  >  logr(p(X)). 

Remark  6.1.  If  one  proved  that 


/  =  o(log  r(p(X)))  (6.8) 

for  some  p(X),  a  fixed  I  X  I  matrix  with  the  entries  from  L(F,X)  (X  =  (xo,  . . . ,  x/_i)) 
(for  instance,  for  the  I  X  I  general  Toeplitz  matrix;  cf.  Remark  2.1)  then  Theorem 
6.1  would  imply  a  nonlinear  lower  bound  on  CA(-±)  for  any  algorithm  A  for  the  bilinear 
problem  defined  by  that  matrix  p(X).  However  inequality  (6.4)  and  the  bound 

log/?(per  p(X))  <  I  known  for  an  arbitrary  I  X  I  matrix  /z(X)  (H.  J.  Ryser,  cf.  (1,  p. 
497])  convince  that  it  is  very  hard  to  prove  (6.8)  if  possible  at  all.  Nevertheless  we  conclude 
with  the  following  challenging  problem:  find  an  appropriate  modification  of  the  definition 
of  r(p(X))  such  that  Equations  (6.5)-(6.8)  hold. 
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